55 research outputs found

    Multi-scale support vector algorithms for hot spot detection and modelling

    Get PDF
    The algorithmic approach to data modelling has developed rapidly these last years, in particular methods based on data mining and machine learning have been used in a growing number of applications. These methods follow a data-driven methodology, aiming at providing the best possible generalization and predictive abilities instead of concentrating on the properties of the data model. One of the most successful groups of such methods is known as Support Vector algorithms. Following the fruitful developments in applying Support Vector algorithms to spatial data, this paper introduces a new extension of the traditional support vector regression (SVR) algorithm. This extension allows for the simultaneous modelling of environmental data at several spatial scales. The joint influence of environmental processes presenting different patterns at different scales is here learned automatically from data, providing the optimum mixture of short and large-scale models. The method is adaptive to the spatial scale of the data. With this advantage, it can provide efficient means to model local anomalies that may typically arise in situations at an early phase of an environmental emergency. However, the proposed approach still requires some prior knowledge on the possible existence of such short-scale patterns. This is a possible limitation of the method for its implementation in early warning systems. The purpose of this paper is to present the multi-scale SVR model and to illustrate its use with an application to the mapping of Cs137 activity given the measurements taken in the region of Briansk following the Chernobyl acciden

    The analysis of kernel ridge regression learning algorithm.

    Get PDF
    The paper presents Kernel Ridge Regression, a nonlinear extension of the well known statistical model of ridge regression. New insights on the method are also presented. In particular, the connection between ridge regression and local translation-invariant squared loss minimization algorithm is shown. An iterative training algorithm is proposed, that allows training the KRR for large datasets. The training time is empirically found to scale quadratically with the number of samples. The application of the model is illustrated on the real datasets

    Habitualisation: localisation without location data

    Get PDF
    This paper looks at identifying the locations of users from the Nokia MDC dataset throughout the day without taking into consideration location based data. By looking at a users habits and idiosyncrasies we determined the likelihood of a users location within known stay regions which we call habitats. The features used to determine location were extracted from a users interaction with the smart phone. None of the features contained a users locations or a users proximity to objects with known locations. Using a set of structured output support vector learning techniques we found that a users location with respect to the areas of typical activities is well predictable solely from daily routines and a smart phone usage habits

    Pattern-Based Sensing of Nucleotides in Aqueous Solution with a Multicomponent Indicator Displacement Assay

    Get PDF
    A multicomponent indicator displacement assay ( MIDA) based on an organometallic receptor and three dyes can be used for the identification and quantification of nucleotides in aqueous solution at neutral pH

    Learning wind fields with multiple kernels

    Get PDF
    This paper presents multiple kernel learning (MKL) regression as an exploratory spatial data analysis and modelling tool. The MKL approach is introduced as an extension of support vector regression, where MKL uses dedicated kernels to divide a given task into sub-problems and to treat them separately in an effective way. It provides better interpretability to non-linear robust kernel regression at the cost of a more complex numerical optimization. In particular, we investigate the use of MKL as a tool that allows us to avoid using ad-hoc topographic indices as covariables in statistical models in complex terrains. Instead, MKL learns these relationships from the data in a non-parametric fashion. A study on data simulated from real terrain features confirms the ability of MKL to enhance the interpretability of data-driven models and to aid feature selection without degrading predictive performances. Here we examine the stability of the MKL algorithm with respect to the number of training data samples and to the presence of noise. The results of a real case study are also presented, where MKL is able to exploit a large set of terrain features computed at multiple spatial scales, when predicting mean wind speed in an Alpine regio

    Tangent Vector Kernels for Invariant Image Classification with SVMs

    Get PDF
    This paper presents an application of the general sample-to-object approach to the problem of invariant image classification. The approach results in defining new SVM kernels based on tangent vectors that take into account prior information on known invariances. Real data of face images are used for experiments. The presented approach integrates virtual sample and tangent distance methods. We observe a significant increase in performance with respect to standard approaches. The experiments also illustrate (as expected) that prior knowledge becomes more important as the amount of training data decreases

    Invariances in Kernel Methods: From Samples to Objects

    Get PDF
    This paper presents a general method for incorporating prior knowledge into kernel methods such as Support Vector Machines. It applies when the prior knowledge can be formalized by the description of an object around each sample of the training set, assuming that all points in the given object share the same desired class. A~number of implementation techniques of this method, based on hard geometrical objects and soft objects based on distributions are considered. Tangent vectors are extensively used for object construction. Empirical results on one artificial dataset and two real datasets of Electro-Encephalogram signals and face images demonstrate the usefulness of the proposed method. The method could establish a foundation for an information retrieval and person identification systems

    From Samples to Objects in Kernel Methods

    Get PDF
    This paper presents a general method for incorporating prior knowledge into kernel methods. It applies when the prior knowledge can be formalized by the description of an object around each sample of the training set, assuming that all points in the given object share the same desired class. Two implementation techniques of this method, based on analytical kernel jittering and the vicinal risk minimization principle, are considered. Empirical results on one artificial dataset and one real dataset based on EEG signals demonstrate the performance of the proposed method

    Stratification structure of urban habitats

    Get PDF
    This paper explores the community structure of a network of significant locations in cities as observed from location-based social network data. We present the findings of this analysis at multiple spatial scales. While there is previously observed distinct spatial structure at inter-city level, in a form of catchment areas and functional regions, the exploration of in-city scales provides novel insights. We present the evidence that particular areas in cities stratify into distinct “habitats” of frequently visited locations, featuring both spatially overlapping and disjoint regions. We then quantify this stratification with normalized mutual information which shows different stratification levels for different cities. Our findings have important implications for advancing models of human mobility, studying social exclusion and segregation processes in cities, and are also of interest for geomarketing analysts developing fidelity schemes and promotional programmes
    corecore